474 research outputs found

    Decoding billions of integers per second through vectorization

    Get PDF
    In many important applications -- such as search engines and relational database systems -- data is stored in the form of arrays of integers. Encoding and, most importantly, decoding of these arrays consumes considerable CPU time. Therefore, substantial effort has been made to reduce costs associated with compression and decompression. In particular, researchers have exploited the superscalar nature of modern processors and SIMD instructions. Nevertheless, we introduce a novel vectorized scheme called SIMD-BP128 that improves over previously proposed vectorized approaches. It is nearly twice as fast as the previously fastest schemes on desktop processors (varint-G8IU and PFOR). At the same time, SIMD-BP128 saves up to 2 bits per integer. For even better compression, we propose another new vectorized scheme (SIMD-FastPFOR) that has a compression ratio within 10% of a state-of-the-art scheme (Simple-8b) while being two times faster during decoding.Comment: For software, see https://github.com/lemire/FastPFor, For data, see http://boytsov.info/datasets/clueweb09gap

    GN-SCCA: GraphNet based Sparse Canonical Correlation Analysis for Brain Imaging Genetics

    Get PDF
    Identifying associations between genetic variants and neuroimaging quantitative traits (QTs) is a popular research topic in brain imaging genetics. Sparse canonical correlation analysis (SCCA) has been widely used to reveal complex multi-SNP-multi-QT associations. Several SCCA methods explicitly incorporate prior knowledge into the model and intend to uncover the hidden structure informed by the prior knowledge. We propose a novel structured SCCA method using Graph constrained Elastic-Net (GraphNet) regularizer to not only discover important associations, but also induce smoothness between coefficients that are adjacent in the graph. In addition, the proposed method incorporates the covariance structure information usually ignored by most SCCA methods. Experiments on simulated and real imaging genetic data show that, the proposed method not only outperforms a widely used SCCA method but also yields an easy-to-interpret biological findings

    Exact Hybrid Covariance Thresholding for Joint Graphical Lasso

    Full text link
    This paper considers the problem of estimating multiple related Gaussian graphical models from a pp-dimensional dataset consisting of different classes. Our work is based upon the formulation of this problem as group graphical lasso. This paper proposes a novel hybrid covariance thresholding algorithm that can effectively identify zero entries in the precision matrices and split a large joint graphical lasso problem into small subproblems. Our hybrid covariance thresholding method is superior to existing uniform thresholding methods in that our method can split the precision matrix of each individual class using different partition schemes and thus split group graphical lasso into much smaller subproblems, each of which can be solved very fast. In addition, this paper establishes necessary and sufficient conditions for our hybrid covariance thresholding algorithm. The superior performance of our thresholding method is thoroughly analyzed and illustrated by a few experiments on simulated data and real gene expression data

    A Regularized Graph Layout Framework for Dynamic Network Visualization

    Full text link
    Many real-world networks, including social and information networks, are dynamic structures that evolve over time. Such dynamic networks are typically visualized using a sequence of static graph layouts. In addition to providing a visual representation of the network structure at each time step, the sequence should preserve the mental map between layouts of consecutive time steps to allow a human to interpret the temporal evolution of the network. In this paper, we propose a framework for dynamic network visualization in the on-line setting where only present and past graph snapshots are available to create the present layout. The proposed framework creates regularized graph layouts by augmenting the cost function of a static graph layout algorithm with a grouping penalty, which discourages nodes from deviating too far from other nodes belonging to the same group, and a temporal penalty, which discourages large node movements between consecutive time steps. The penalties increase the stability of the layout sequence, thus preserving the mental map. We introduce two dynamic layout algorithms within the proposed framework, namely dynamic multidimensional scaling (DMDS) and dynamic graph Laplacian layout (DGLL). We apply these algorithms on several data sets to illustrate the importance of both grouping and temporal regularization for producing interpretable visualizations of dynamic networks.Comment: To appear in Data Mining and Knowledge Discovery, supporting material (animations and MATLAB toolbox) available at http://tbayes.eecs.umich.edu/xukevin/visualization_dmkd_201

    Noncommutative geometry inspired black holes in higher dimensions at the LHC

    Full text link
    When embedding models of noncommutative geometry inspired black holes into the peridium of large extra dimensions, it is natural to relate the noncommutativity scale to the higher-dimensional Planck scale. If the Planck scale is of the order of a TeV, noncommutative geometry inspired black holes could become accessible to experiments. In this paper, we present a detailed phenomenological study of the production and decay of these black holes at the Large Hadron Collider (LHC). Noncommutative inspired black holes are relatively cold and can be well described by the microcanonical ensemble during their entire decay. One of the main consequences of the model is the existence of a black hole remnant. The mass of the black hole remnant increases with decreasing mass scale associated with noncommutative and decreasing number of dimensions. The experimental signatures could be quite different from previous studies of black holes and remnants at the LHC since the mass of the remnant could be well above the Planck scale. Although the black hole remnant can be very heavy, and perhaps even charged, it could result in very little activity in the central detectors of the LHC experiments, when compared to the usual anticipated black hole signatures. If this type of noncommutative inspired black hole can be produced and detected, it would result in an additional mass threshold above the Planck scale at which new physics occurs.Comment: 21 pages, 7 figure

    Holographic Lovelock Gravities and Black Holes

    Get PDF
    We study holographic implications of Lovelock gravities in AdS spacetimes. For a generic Lovelock gravity in arbitrary spacetime dimensions we formulate the existence condition for asymptotically AdS black holes. We consider small fluctuations around these black holes and determine the constraint on Lovelock parameters by demanding causality of the boundary theory. For the case of cubic Lovelock gravity in seven spacetime dimensions we compute the holographic Weyl anomaly and determine the three point functions of the stress energy tensor in the boundary CFT. Remarkably, these correlators happen to satisfy the same relation as the one imposed by supersymmetry. We then compute the energy flux; requiring it to be positive is shown to be completely equivalent to requiring causality of the finite temperature CFT dual to the black hole. These constraints are not stringent enough to place any positive lower bound on the value of viscosity. Finally, we conjecture an expression for the energy flux valid for any Lovelock theory in arbitrary dimensions.Comment: 31 pages, 1 figure, harvmac, references added, calculation of viscosity/entropy ratio include

    Relaxed 2-D Principal Component Analysis by LpL_p Norm for Face Recognition

    Full text link
    A relaxed two dimensional principal component analysis (R2DPCA) approach is proposed for face recognition. Different to the 2DPCA, 2DPCA-L1L_1 and G2DPCA, the R2DPCA utilizes the label information (if known) of training samples to calculate a relaxation vector and presents a weight to each subset of training data. A new relaxed scatter matrix is defined and the computed projection axes are able to increase the accuracy of face recognition. The optimal LpL_p-norms are selected in a reasonable range. Numerical experiments on practical face databased indicate that the R2DPCA has high generalization ability and can achieve a higher recognition rate than state-of-the-art methods.Comment: 19 pages, 11 figure

    Alleviating the non-ultralocality of coset sigma models through a generalized Faddeev-Reshetikhin procedure

    Get PDF
    The Faddeev-Reshetikhin procedure corresponds to a removal of the non-ultralocality of the classical SU(2) principal chiral model. It is realized by defining another field theory, which has the same Lax pair and equations of motion but a different Poisson structure and Hamiltonian. Following earlier work of M. Semenov-Tian-Shansky and A. Sevostyanov, we show how it is possible to alleviate in a similar way the non-ultralocality of symmetric space sigma models. The equivalence of the equations of motion holds only at the level of the Pohlmeyer reduction of these models, which corresponds to symmetric space sine-Gordon models. This work therefore shows indirectly that symmetric space sine-Gordon models, defined by a gauged Wess-Zumino-Witten action with an integrable potential, have a mild non-ultralocality. The first step needed to construct an integrable discretization of these models is performed by determining the discrete analogue of the Poisson algebra of their Lax matrices.Comment: 31 pages; v2: minor change

    Four-Dimensional SCFTs from M5-Branes

    Full text link
    We engineer a large new set of four-dimensional N=1 superconformal field theories by wrapping M5-branes on complex curves. We present new supersymmetric AdS_5 M-theory backgrounds which describe these fixed points at large N, and then directly construct the dual four-dimensional CFTs for a certain subset of these solutions. Additionally, we provide a direct check of the central charges of these theories by using the M5-brane anomaly polynomial. This is a companion paper which elaborates upon results reported in arXiv:1112:5487.Comment: 45 pages, 11 figure

    Correlators of Vertex Operators for Circular Strings with Winding Numbers in AdS5xS5

    Full text link
    We compute semiclassically the two-point correlator of the marginal vertex operators describing the rigid circular spinning string state with one large spin and one windining number in AdS_5 and three large spins and three winding numbers in S^5. The marginality condition and the conformal invariant expression for the two-point correlator obtained by using an appropriate vertex operator are shown to be associated with the diagonal and off-diagonal Virasoro constraints respectively. We evaluate semiclassically the three-point correlator of two heavy circular string vertex operators and one zero-momentum dilaton vertex operator and discuss its relation with the derivative of the dimension of the heavy circular string state with respect to the string tension.Comment: 16 pages, LaTeX, no figure
    corecore